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Abstract. The question of the state-size cost for simulation of two-way nondeter- 
ministic automata (2nfas) by two-way deterministic automata (2dfas) was raised 
in 1978 and, despite many attempts, it is still open. Subsequently, the problem was 
attacked by restricting the power of 2dfas (e.g., using a restricted input head move- 
ment) to the degree for which it was already possible to derive some exponential 
gaps between the weaker model and the standard 2nfas. Here we use an opposite 
approach, increasing the power of 2dfas to the degree for which it is still possible to 
obtain a subexponential conversion from the stronger model to the standard 2dfas. 
In particular, it turns out that subexponential conversion is possible for two-way au- 
tomata that make nondeterministic choices only when the input head scans one of the 
input tape endmarkers. However, there is no restriction on the input head movement. 
This implies that an exponential gap between 2nfas and 2dfas can be obtained only 
for unrestricted 2nfas using capabilities beyond the proposed new model. 

As an additional bonus, conversion into a machine for the complement of the 
original language is polynomial in this model. The same holds for making such 
machines self- verifying, halting, or unambiguous. Finally, any superpolynomial lower 
bound for the simulation of such machines by standard 2dfas would imply L ^ NL. 
In the same way, the alternating version of these machines is related to L = NL = P , 
the classical computational complexity problems. 
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1. Introduction 

The cost, in terms of states, of the simulation of two-way nondeterministic automata (2nfas, 
for short) by two-way deterministic automata (2dfas) is one of the most important and 
challenging open problems in automata theory and, in general, in theoretical computer 
science. This problem was proposed in 1978 by Sakoda and Sipser [SS78j . who conjectured 
that the cost is exponential. However, in spite of all effort, exponential gaps were proved 
only between 2nfas and some restricted weaker versions of 2dfas. 
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In 1980, Sipser proved that if the resulting machine is required to be sweeping (deter- 
ministic and reversing the direction of its input head only at the endmarkers, two special 
symbols used to mark the left and right ends of the input), the simulation of a 2nfa is 
indeed exponential [SipSOb] , However, Berman and Micali |Ber801 fMicSl] proved indepen- 
dently that this does not solve the general problem: in fact the simulation of unrestricted 
2DFAS by sweeping 2dfas also requires an exponential number of states. The Sipser's result 
was generalized by Hromkovic and Schnitger |HS03] . who considered oblivious machines 
(following the same trajectory of input head movements along all inputs of equal length) 
and, recently, by Kapoutsis |Kaplla| , considering 2dfas with the number of input head 
reversals that is sublinear in the length of the input. However, even the last condition gives 
a machine provably less succinct than unrestricted 2dfas, and hence the general problem 
remains open. 

Starting from 2003 with a paper by Geffert et al. |GMP03] . a different kind of restriction 
has been investigated: the subclass or regular languages using a single-letter input alphabet. 
Even under this restriction, the problem of Sakoda and Sipser looks difficult, since it is 
connected with L = NL, an open question in complexity theory. (L and NL denote the 
respective classes of languages accepted in deterministic and nondeterministic logarithmic 
space.) First, in |GMP03] . a new normal form was obtained for unary automata, in which 
all nondeterministic choices and input head reversals take place only at the endmarkers. 
Moreover, the state-size cost of the conversion into this normal form is only linear. This 
normal form is a starting point for several other properties of unary 2nfas. First, in the same 
paper, each n-state unary 2nfa is simulated by an equivalent 2dfa with 0(nr^°^2("+^)+^T ) 
states, which gives a subexponential but still superpolynomial upper bound. It is not known 
whether this simulation is tight. However, a positive answer would imply the separation 
between the classes L and NL. In fact, under assumption that L = NL, each unary 2nfa with 
n states can be simulated by a 2dfa with a number of states polynomial in n [GPll) . After 
a minor modification (without assuming L = NL), this gives that each unary 2nfa can be 
made unambiguous, keeping the number of the states polynomial. (For further connections 
between two-way automata and logarithmic space, we address the reader to [BL77t Kapllbj .) 



Along these lines of investigation, in |GMP07] . the problem of the complementation 
for unary 2nfas has been considered, by proving that each n-state 2nfa accepting a unary 
language L can be replaced by a 2nfa with 0(n®) states accepting the complement of L. The 
proof combines the above normal form for unary 2nfAs with inductive counting arguments. 

Kapoutsis |Kap06| considered the complementation in the case general input alphabets, 
but restricting the input head reversals. He showed that the complementation of sweeping 
2nfas (with the input head reversals only at the endmarkers) requires exponentially many 
states, thus emphasizing a relevant difference with the unary case. 

In this paper, we use a different approach. Instead of restricting the power of 2dfas 
to the degree for which it is already possible to derive an exponential gap between the 
weaker model and the standard 2nfas, we increase the power of 2dfas, towards 2nfas, 
to the degree for which it is still possible to obtain a subexponential conversion from the 
stronger model to the standard 2dfas. Such new stronger model then clearly shows that, 
in order to prove an exponential gap between 2nfas and 2dfas, one must use capabilities 
not allowed in the proposed new model. More precisely, in our new model, we neither 
restrict the cardinality of input alphabets, nor put any constraint on the head movement. 
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i.e., head reversals can take place at any input position. On the other hand, we permit 
nondeterministic choices only when the input head is scanning one of the endmarkers. We 
shall call such machine a two-way outer-nondeterministic finite automaton (2onfa). 

It turns out that this machine has its natural counterpart also in the case of two-way 
alternating finite automata (2afas), which is a two-way outer- alternating finite automaton 
(2oafa), making universal and existential choices only at the endmarkers. (For recent 
results on 2afas, see iKap09[ iGefllj .) 

We show that several results obtained for unary 2nfas can be extended to 2onfas, 
and some of them even to the alternating version, 20AFAs, with any input alphabet. In 
particular, we prove the following: 

• Each n-state 20NFA can be simulated by a halting two-way self-verifying automa- 
ton (2SVFA) |DHRS97] with 0{n^) many states. This fact has two important impli- 
cations: 

* The complementation of 2onfas can be done by using a polynomial number 
of states. Note the contrast with the above mentioned case of sweeping 2nfas 
studied in |Kap06| . 

* Each 20NFA can be simulated by a halting 2onfa using a polynomial number of 
states. 

• Each n-state 20NFA can be simulated by a 2dfa with 0(n^°^2"+6) states. 

• If L = NL, then each n-state 2onfa can be simulated by a 2dfa with a number of 
states polynomial in n. Hence, a super polynomial lower bound for the simulation of 
20NFAs by 2dfas would imply L ^ NL. (Unlike in (BL77] . there are no restrictions 
on the length of potential witness inputs.) 

• Each n-state 2onfa can be simulated by an unambiguous 20NFA with a polynomial 
number of states. 

• If L = P, then each n-state 20AFA can be simulated by a 2dfa with a number 
of states polynomial in n, with the same consequences as presented for L = NL. 
(P denotes, as usual, the class of languages recognizable by deterministic Turing 
machines in polynomial time.) 

• Similarly, if NL = P, we get the corresponding polynomial conversion from 20AFAs 
to 2nfas. 

These results are obtained by generalizing the constructions given in |GMP03| IGMPOTf 
IGPll] for the unary case. However, here we do not have a normal form for simplifying 
the automata, by restricting input head reversals to the endmarkers. Our generalization 
rely on a different tool, presented in the first part of the paper. Basically, we extend 
some techniques developed originally for deterministic devices |Sip80a[ [GMP07| to machines 
with nondeterminism at the endmarkers. This permits us to check the existence of certain 
computation paths, including infinite loops, by the use of a linear number of states. 

The paper is organized as follows. In Section [2] we recall basic definitions and preliminary 
results required later. In Section[3l after introducing 2onfas, we develop a fundamental tool 
that will be used several times, namely, a deterministic procedure that allows us to check 
the existence of computation paths between two given states in the given 20NFA, starting 
and ending at the left endmarker and do not visiting the endmarkers in the meantime. 
This procedure is also useful to make all computations halting. The next sections are 
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devoted to our main results. In Section HI we present the polynomial simulation of 2onfas 
by 2svFAs and its consequences. In Section [5l we show the subexponential simulation of 
20NFAs by 2dfas. Then, in Section El under the assumption L = NL, such simulation is 
with a polynomial number of states. Furthermore, we show how to simulate a 2onfa by an 
unambiguous 2nfa using a polynomial number of states. Finally, in Section [71 we present 
the corresponding results for the alternating case. Some concluding remarks are briefly 
discussed in Section [8l 

2. Preliminaries 

Let us start by briefly recalling some basic definitions from automata theory. For a detailed 
exposition, we refer the reader to |HU79j . Given a set S, \S\ denotes its cardinality and 2^ 
the family of all its subsets. 

A two-way nondeterministic finite automaton (2nfa, for short) is defined as a quintuple 
A = {Q,T,,6,qi,F), in which Q is a finite set of states, S is a finite input alphabet, 6 : 
Q X (S U H}) — >■ 2*^^''^""'^''''"^^^ is a transition function, where l-,H ^ S are two special 
symbols, called the left and the right endmarkers, respectively, qi (z Q is an initial state, and 
F C Q is a set of final states. The input is stored onto the input tape surrounded by the two 
endmarkers, the left endmarker being at the position zero. In one move, A reads an input 
symbol, changes its state, and moves the input head one position forward, backward, or 
keeps it stationary depending on whether 6 returns +1, —1, or 0, respectively. The machine 
accepts the input, if there exists a computation path from the initial state Qi with the head 
on the left endmarker to some final state q £ F. The language accepted by A, denoted 
by L{A), consists of all input strings that are accepted. A is said to be halting if each 
computation ends in a finite number of steps. 

Observing that if an accepting computation visits the same endmarker two times in the 
same state then there exists a shorter accepting computation on same input, we immediately 
get the following lemma, which will be used in the proofs of some of our results: 

Lemma 2.1. // a 2nfa A with n states accepts an input w, then it also accept w with a 
computation that visits the left (right) endmarker at most n times. 

Throughout the paper, given a 2nfa A, we will call computation segment (or just seg- 
ment) from p to q on w, each computation path on an input w that starts at the left 
endmarker in the state p, ends at the left endmarker in the state q and never visits the same 
endmarker in the meantime. 

A sequence of t > segments from a state p to a state q on input is a sequence of 
segments such that there are states po,pi, ■ ■ ■ ,Pt, with pQ = p, Pt = q, and the ith segment 
is from pi^i to pi, i = 1, ... ,t. 

A 2nfa a is said to be deterministic (2dfa), whenever \5{q,a)\ < 1, for any q G Q 
and 0" € S U {l-,H}, A is called unambiguous (2ufa), if there exists at most one accepting 
computation path for each input. A two-way self-verifying automaton (2svfa) ^ is a 2nfa 
which, besides the set of accepting states F C Q, is equipped also with a disjoint set of 
rejecting states C Q. For each input vu € L{A) there exists one computation path 
halting in a state q G F, and no path may halt in a state q & F"^ . Conversely, for w ^ L{A) 
there exists one computation path halting in a state q £ F^, and no path may halt in a 
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state q (z F. Note that some computation paths of a 2svFA may end with a "don't-know" 
answer, by ending in one state not belonging to -F U F"^, or entering into an infinite loop. 
An automaton working over a single letter alphabet is called unary. 

We assume that the reader is familiar with the standard Turing machine model and the 
basic facts from space complexity theory. For more details, see e.g. |HU791 ISze94j . Our 
Turing machine (both deterministic and nondeterministic) is equipped with a finite state 
control, a two-way read-only input tape, and a separate semi-infinite two-way read-write 
worktape, initially empty. Such machine is s{n) -space bounded, if no computation uses more 
than s(n) worktape cells, for each input of length n. 

The class of languages accepted in s{n) < O(logn) space by deterministic Turing ma- 
chines is denoted by L, while the corresponding classes for nondeterministic machines by NL. 

3. Outer-nondeterministic automata and the subroutine REACH 

In this section we introduce the computational model investigated in the paper, and we 
develop some preliminary results concerning it. First, we provide a simplification of the 
model with respect to the definition. After that, we will present the main result of this 
section. We will show how to detect the existence of computational segments in our devices. 
This result will be a fundamental tool, used in the proof of all the other results presented 
in the paper. 

A two-way outer-nondeterministic finite automaton (20NFA, for short) is a 2nfa A = 
{Q,Ti,5,qi, F) that can take nondeterministic decisions only when the input head is scanning 
one of the two endmarkers, i.e., for each g E Q, a G S, a)| < 1. Actually, with a linear 
increasing in the number of the states, we can further restrict the use of the nondeterminism 
to the left endmarker only. We can also obtain some other restrictions, which will be useful 
to simplify our proofs: 

Lemma 3.1. For any n-state 20NFA A = (Q, S, 5, qi,F) there exists an equivalent 20NFA A' 
with no more than 3n states that satisfies the following properties: 

(i) nondeterministic choices are taken only when the input head is scanning the left 
endmarker, 

(ii) there is a unique accepting state qp and this state is also halting, 

(iii) qp is reachable only at the left endmarker by stationary moves, 

(iv) stationary moves can occur only at the left endmarker to enter the state qp. 

Proof: If -F = then A accept the empty language and it can be replaced by the trivial 
automaton with two states and qp, and without any transition. So, from now on let us 
suppose F 7^ 0. 

First, we can make each final state also halting, by making the transition function 
undefined from final states. Furthermore, each stationary move leading to a final state can 
be replaced by a transition to the same final state that moves the head either to the left or 
to the right. Now, by inspecting the transition function, for each q G Q and a G E U {h, H}, 
we compute the set of pairs {p, d) such that A in the state q scanning an input square 
containing the symbol a, after a possible sequence of stationary moves, will finally reach the 
state p moving the head in the direction d G {—1,1}. Stationary moves from the state q 
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with input a are replaced by all possible transitions described by those pairs {p,d). Notice 
that if a is an ordinary input symbol, i.e., a ^ {l-,H}, then we can have at most one such 
a pair. Hence, after this transformation, nondeterministic choices remain confined at the 
endmarkers and the resulting automaton does not perform any stationary move. 

Now, from the automaton A so modified, we build an automatom A' making nonde- 
terministic choices only on the left endmarker. Normally, A' makes the same moves as A. 
However, when the head reaches the right endmarker in a state q, A' traverses the input 
from right to left, using a copy V of to reach the left endmarker. At this position, the 
automaton simulates a transition from q at the right endmarker, i.e., it chooses a state p 
such that {p, — 1) G 6{q, H) and, using a copy ^ of p, it traverses the input from left to right 
until to reach the right endmarker. Here, A' moves its head one position to the left entering 
the original state p. From this configuration A' resumes the simulations of the ordinary 
moves of A. 

Finally, to accept in qp on the left endmarker, we make two further small changes. First, 
when A' enters a state p G F, it starts to move its head to the left using the state V> until 
to reach the left endmarker. At that position, from V) -4' finally enters qp without moving 
the input head. Furthermore, when A', scanning the left endmarker, should move in a state 

for some p £ F, then it enters directly qp without moving the input head. This second 
change permit us to save the states '0, for p F. 

It can be easily seen that the resulting automaton A' is equivalent to the given 2onfa A 
and satisfies the properties listed in the statement of the lemma. The set of states consists 
of 3 copies of the set Q plus the state qp. However, for each final state p of A' , the copy ~^ 
is useless. Hence, the total number of states of A' does not exceed 3re. □ 

The remaining part of this section is devoted to develop a tool which will be fundamental 
in the proof of our results. Given a 20NFA A with n states, we show the existence of a 
subroutine Reach that receives as parameters two states g', q" of A and decides whether or 
not A has a computation segment from q' to q" on an input string w. We will show that this 
subroutine can be implemented using a deterministic finite state control with 0{n) states 
whose input w is stored on a two-way tape. 

At a first glance, we could try to compute REACH(g', q") by initializing the automaton A 
in the state q' with the input head at the left endmarker and by stopping its computation 
as soon as the input head, in one of the following steps, reaches again the left endmarker, 
then testing whether or not the state so reached is q" . However, this approach presents two 
problems: first of all, the original automaton A could enter into a infinite loop, never coming 
back to the left endmarker; second, the first move from the state q' on the left endmarker 
can be nondeterministic. 

In order to solve the first problem, we adapt the construction given in |GMP07] to 
transform each n-state 2dfa into an equivalent halting 4n-state 2dfa, which, it turns, was 
a refinement of the construction obtained by Sipser |Sip80a| to make space bounded Turing 
machines halting. We give a brief outline. For each w G E*, a deterministic machine accepts 
w if and only if there is a "backward" path, following the history of the computation in 
reverse, from the unique accepting configuration cj to the unique initial configuration cq. In 
our setting, a "configuration" is a pair (s, i), where s G Q is a state and i G {0, . . . , + 1} 
is an input head position. 

Consider the graph whose nodes represent configurations and edges computation steps. 



6 



If the machine under consideration is deterministic, the component of the graph containing 
the accepting configuration c/ is a tree rooted at this configuration, with backward paths 
branching to all possible predecessors of cy. In addition, if the accepting configuration is 
also halting, no backward path starting from c/ can cycle (hence, it is of finite length). 
Thus, the equivalent halting machine can perform a depth-first search of this tree in order 
to detect whether the initial configuration cq belongs to the predecessors of cj. If this is 
the case, then the simulator accepts. On the other hand, if all the tree is examined without 
reaching cq, this means that there are no paths from cq to Cf and so w is not in the language. 
Hence, the simulator rejects. 

We adapt such a procedure by choosing cq = (g',0) and Cf = {q",0), where q' and q" 
are the two parameters. Furthermore, since we are interested to detect the existence of just 
one segment from q' to q" , we do not consider the transitions on the left endmarker from 
states different than q' . However, the remaining transition, i.e., that from configuration 
Co, can be nondeterministic. Thus, we need further modifications. Suppose 6{q',\-) = 
{{qi,+l), . . . , (g/c+l)}. Hence, there is a segment from cq to cj if and only if there is a 
path from Cj = (qj,!) to c/, for some j G {!,... ,A:}, visiting the left endmarker only in 
Cf. Hence, the backward search can be done starting from Cf, without considering all the 
transitions of the original automaton from the left endmarker, stopping and accepting when 
one of the Cj's is reached, or rejecting when all the tree has been visited without reaching 
any of the cj's. 

To do that, our procedure needs to detect when the head position of the original 2onfa 
A is scanning the first "real" input symbol, i.e., that immediately to the right of the left 
endmarker, in cell number 1. This can be done as follows. We say that a left predecessor for 
the configuration {s,i) is a configuration {t,i — 1), with the input head scanning a symbol 
a, such that (s,-|-l) G 5{t,a). Hence, in the construction of the backward path from {q",0) 
to {q',0), the configuration {q',0) can be reached only as a left predecessor of some {qj, 1), 
jE{1,...,/c}. By a closer look to the simulation in [GMP07] . we can observe that all the left 
precedessors of a configuration (s, i) are examined in a copy s\ of the state s, with the input 
head one position to the left of the actual position of the original machine, i.e., in position 
i — 1. Hence, when the simulating machine reaches one of the states qi\, ■ ■ ■ ,qk\, with 
the input head scanning the left endmarker at the same time, it can stop the computation 
and accept. There are only two points which depend on the states q',q": the choice of the 
initial state, i.e., of the configuration c/, and the detection of configurations cj = {qj, 1) of 
A, when the head of A' is scanning the left endmarker. 

This construction can be implemented by a deterministic finite state control with in — 3 
states (not counting the space needed to store the parameters q',q", which will be used in 
a "read-only" way). More details are presented in Appendix Rl Hence, with this strategy 
we obtain the following result which will be a fundamental tool in the next sections: 

Lemma 3.2. For each 20NFA A with n states (in the form of Lemma \3.1\) . it is possible 
to construct a deterministic finite state control with 4n — 3 states that given an input string 
w G S* stored on a two-way tape and two states q', q" of A decides whether or not the 
automaton A has a computation segment from q' to q" on w. 
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4. Simulation by halting self-verifying automata 



In this section we prove that for each n-state 2onfa A accepting a language L there exists an 
equivalent halting 2svFA using a polynomial number of states and making nondeterministic 
choices only when the input head is scanning the left endmarker. As a consequence, we can 
derive halting 2onfas with polynomial many states that accept L and the complement of L. 

Even in this case, our starting point is a proof given in |GMP07] for the unary case, 
which was based on the well known technique of the inductive counting. However, there 
are deep differences. In particular, the proof in |GMP07] uses a normal form for unary 
2nfas in which a computation is a sequence of deterministic traversals of the input from 
one endmarker to the opposite one. In this normal form there are no parts of computations 
starting and ending at the same endmarker without visiting the other one in the meantime 
(these parts are usually called U-turns). Furthermore, the only possible infinite loops involve 
the endmarkers and can be easily avoided using Lemma 12.11 The simulation inductively 
counts, for increasing values of t, how many states are reachable from the initial state in t 
traversals of the input and, as a side effect of this counting procedure, also lists these states. 
In this way, after a certain number of traversals, all the states which are reachable at the 
endmarkers have been listed and, hence, it is possible to decide whether or not the input 
was accepted by the original machine. 

In the case we are considering, we do not have such a kind of normal form. Hence, a 
computation can present traversals of the input from one endmarker to the opposite one as 
well as [/-turns. Furthermore, a computation can reject by entering into a infinite loop: in 
this case infinite loops which never visits the endmarkers are also possible. To overcome the 
first problem, instead of traversals, our inductive counting procedure considers computation 
segments: for increasing values of i, it counts how many states are reachable from the initial 
state in a computation consisting of t segments, i.e., visiting the left endmarker t + 1 times, 
and, at the same time, the procedure lists these states. Since, as stated in Lemma 12. 1^ 
each accepted input has an accepting computation which visits the left endmarker at most 
n times, it is enough to consider computations consisting of at most n — 1 segments. This 
avoid infinite loops involving the endmarkers. We will discuss later on how to deal with the 
remaining infinite loops, namely those that do not involve endmarkers. 

In some steps, the simulation we present can reach the "don't-know" state q-,. We 
always assume that this state is halting: hence, when it is reached, even in the code of the 
subroutines, the entire simulation will be aborted. 

To simulate single computation segments, we will make use of the subroutine Reach, 
discussed in Section [3l Furthermore we use another subroutine tREACH which with param- 
eters q Q and t > verifies the existence of sequence of t segments from the initial state qi 
to q on the input under consideration. In negative case, tREACH aborts the simulation in 
the state q-,, otherwise the subroutine returns to the main simulation in a different state. 
(As we will see, this subroutine is nondeterministic. It can also halt in the state q-,, due 
to a wrong sequence of nondeterministic guesses. However, if there exists a sequence of t 
segments from qi to q, the subroutine has at least one computation ending in a state other 
than q-,.) 
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Algorithm 1: Simulation of 20NFAs by 2svFAs 



1 m := 

2 for t : 

m 



1 



to IQI - 2 do 
= to'; to' := 
foreach q' G Q do 
for i := 1 to m do 

q := a, nondeterministically chosen state 
if (i > 1 and q < gprcv) then halt in q, 

Qprcv • — Q 

if tREACH(g,t) and REACH(gpi.ov, g') then 

// possible side effect of tREACH : abort in q, 
if q' = qp then hah in qy^s 
to' to' + 1 
break 
hah in qno 



The algorithm proceeds by counting, for t = 0, . . . , |Q| — 2, the number of states reachable 
by A at the left endmarker by all computation paths starting from the initial configuration 
and consisting of exactly t+1 segments (loop from line[2]). During this process, the algorithm 
also generates all states reachable at the left endmarker, and hence it can correctly decide 
whether to accept or reject the given input. 

At the beginning of the t-th iteration of this loop (line [2]) , a variable m' contains the 
number of states reachable at the left endmarker by all computation paths with exactly t 
segments. (In line [H we prepare m' = 1 for t = 0, the only state reachable by segments 
being the initial state qi.) In lineO we save the "old" value of m' in the variable m, and 
clear m' for counting the number of states reachable upon completing one more segment, 
i.e., with exactly t + 1 segments. 

The value of m' is computed in the loop from line [H for each state q' £ Q, we test 
whether or not it is reachable by a path with exactly t + 1 segments. If it is, we increment 
the value of m'. 

To decide whether q' can be reached by exactly t + 1 segments, we generate all m states 
that are reachable at the left endmarker by all computation paths of exactly t segments 
and we verify if q' can be reached by any of these states by a single segment. This part is 
realized by the innermost loop (from lined]) 

The loop generates in a nondeterministic way all the sequences of m states. To avoid to 
consider a same set of m states twice, or to consider sequences containing repeated states, 
the sequences that do not respect a preliminary fixed order on Q are discarded, aborting 
the computation in q? (line[7I). As a side effect of the call tREACH(g, t) (line [9]), among the 
remaining sequences the procedure filters the only one consisting of the m states reachable 
in exactly t segments, discarding the remaining. In fact, if the subroutine does not find a 
sequence of t segments from qi to q, then it aborts the computation in q-?. (Further details 
about this subroutine, which is also nondeterministic, are given below.) Hence, the only 
computation which "survive" is the one generating, according to the fixed order, all m 
states reachable in exactly t segments. For each q among these m states, the algorithm tests 
whether the state q' under examination is reachable from q in one step (call REACH(gprevi q') 
on line [9l for implementation reasons, the first parameter in the call is ^prev which was 
initialized in line [8] with the value of q). If also the result of this test is positive, then the 
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variable m! is incremented. At this point we can abort the innermost loop (line [T2]) and 
continue with the next iteration, if any, of the loop from line HI to state the reachability in 
t + 1 segments of another state. Furthermore, if during this process we discover that the 
final state is reachable at the left endmarker, we stop the computation and accept by halting 
in the state q'yes- 

On the other hand, if the iteration of the outermost loop has been completed for each 
i = 0,...,|Q| — 2, never reaching at the left endmarker, then the input is not accepted 
by the original automaton. (Otherwise, the search would have stopped already, in linellOi) 
Therefore, in line[l3l we stop in the rejecting state ^no- 

It is not hard to see that: (i) if the input is accepted by A, at least one computation 
path halts in the state (/yes, and no path halts in qno, (ii) if the input is rejected, at least one 
path halts in gnoi and no path halts in qycs- (iii) Due to wrong sequences of nondeterministic 
guesses, some computation paths halt in q-,, but no path can get into an infinite loop. 

Finally, we briefly discuss a possible implementation of the subroutine tREACH (for 
further details see Appendix [B]). Firstly, we can modify the backward seach described in 
Section [3l to obtain subroutine that given as parameter a state q" outputs a nondetermin- 
istically chosen state q' such that A has a computation segment from q' to q" on the given 
input w. This subroutine, called uReach, can also end in the state g?, aborting the entire 
computation, if it is not able to find such a state q' . This can be done using 4n — 4 states, 
besides g?. The implementation of tREACH(g, t) consists of a loop of t iterations. In each one 
of them, uReach is called on the parameter q, assigning to it the result. Hence, at the end, 
if q contains the initial state then the search was successful and tREACH returns the control 
to the caller, implicitly giving a positive answer, otherwise it aborts the entire computation 
in q-,. Instead of this approch, we could use, as in [GMPOTj subroutine simulation], a direct 
nondeterministic simulation of A, in order to generate all the states reachable in exactly 
t segments. This will also produce a correct inductive counting procedure. However, due 
to the fact that A can enter some infinite loop that does not visit the endmarkers, the 
resulting procedure could also enter some loop. Our implementation produces an halting 
automata, obtaining, as shown below, the same upper bound for the number of the states, 
as the implementation in [GMP07] for the unary case. 

The number of possible values of each one of the 6 variables m, m', t, q, q' , qprcv is bounded 
by n + 1. The subroutine tREACH uses one counter bounded by n and 0(n) states to run 
uReach, besides the "global" variable q, hence O(n^) states. The implementation of Reach 
uses 0(n) states, however, it can recycle part of the space used by tREACH. Hence, all the 
simulation can be carried out using O(n^) states. 

Finally, we also observe that in the main algorithm and in the subroutines all the non- 
deterministic choices can be taken when the input head is scanning the left endmarker. 
By summarizing, we have proved the following: 

Theorem 4.1. Each n-state 2onfa can be simulated by an equivalent halting 0{n^)-state 
2SVFA making nondeterministic choices only when the input head is scanning the left end- 
marker. 

Corollary 4.2. For each n-state 20NFA A there exist an equivalent halting 20NFA A! with 
0{n^) states and a 2onfa A" with 0(n®) states accepting the complement of the language 
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accepted by A. 



5. Subexponential deterministic simulation 

In this section, we prove that each 2onfa with n states can be simulated by an equivalent 
2dfa with 0(n'°S2"-+6^ states, i.e., with a subexpontential, but still superpolynomial, num- 
ber of states. In the authors' knowledge this is the first case of a model using nondeterminism 
and an unrestricted alphabet, having a subexponential simulation by 2dfas. 

This result generalizes a result proved for the unary case in [GMP03] . Actually, even the 
proof is very similar: the new "ingredient" in our version is the subroutine REACH presented 
in Section [3j So we give a very short presentation, addressing the reader to |GMP03] for 
further details. 

Let ^ be a 2onfa with n states in the form of Lemma 13. li The 2dfa simulating A 
implements a recursive function, called Reachable, based on the well known divide-and- 
conquere technique. 

The function receives three parameters: two states q and p and an integer t > 1, and 
returns a boolean. On these parameters, REACHABLE(g,p, t) returns true if and only if on 
the input w under consideration the automaton A has a sequence of at most t segments 
from the state q to the state p. Hence, according to Lemma [2. 11 to decide whether or not w 
is accepted by A, we call REACHABLE(gi, qp,n — l). The pseudocode of the function is given 
below. We point out that is the base of the recursion, i = 1, we have to verify the existence 
of a sequence of at most 1 segment from q to p. A sequence with segments is possible if 
and only if q = p. The existence of a sequence of just one segment can be verified using the 
subroutine Reach. 



Algorithm 2: Recursive function Reachable((7,p, t) 



14 if t = 1 then return {q = p or Reach(<7,p)) 

15 else 



16 
17 
18 



foreach state r e Q do 

if REACHABLE(g, r, \t/2']) then 
|_ if REACHABLE(r,p, \t/2']) then return true 
return false 



We can prove the following: 

Theorem 5.1. Each n-state 2onfa can he simulated by an equivalent 2dfa with 0(n'°^2 "+6) 
states. 

Proof: The implementation of REACHABLE and its complexity analysis are very close to 
those given in |GMP03] for the unary case. We just outline a rough estimation of the state 
upper bound. 

First, we suppose that the given 2onfa A is in the form given in Lemma 13.11 The 
implementation of the function REACHABLE can be done using a constant height stack, as 
in [GMP03] . with the following differences: 

• the height of the stack is [log2(n — 1)] (instead of [log2(re + 1)]), 
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• the subroutine Reach uses 4n — 3 state (the corresponding subroutine in |GMP03] 
was implemented with n? + 3 states). 
This leads to an upper limit for the number of different stack configurations 4n(2n)r^°S2("-i)l 
(for the sake of simplicity, we deleted some negative terms, so it could be possible to give a 
slightly more accurate upper bound), which is bounded by 4nr'°S2("-i)l+2_ jf ^j^e automaton 
A is not in form of Lemma 13.11 we need to convert it using 3n states. Hence, by replacing 
n by 3n in the above upper bound, we obtain the following rough upper estimation: 

4(3n)ri°g2(3n-l)l+2 < 36 . . (3j^)riog2(3n-l)l 

< 36-n2.9n2.nri°S2("-i/3)+log2 3l 
= 0(n'°S2(")+^) 

We can observe that c < 6. □ 

It is natural to wonder if the upper bound stated in Theorem 15. II is optimal. We remind 
the reader that the best known lower bound for the number of the states of 2dfas simu- 
lating n-state 2nfas is O(n^) [Chr86j . In the next section we will show that the optimality 
of the upper bound in Theorem 15.11 or any other superpolynomial state lower bound for 
the simulation of 2onfas by 2dfas would imply the separation between deterministic and 
nondeterministic logarithmic space, hence solving a longstanding open problem in structural 
complexity. 

6. Conditional and unambiguous simulations 

In this section we discuss how to reduce the language accepted by a given 2onfa to the graph 
accessibility problem (GAP), i.e., the problem of deciding whether a directed graph contains 
a path connecting two designated vertices. This problem is well known to be complete for 
NL, the class of languages accepted by O(logn) space bounded machines. As a consequence 
of this reduction we will prove that the equality between the classes L and NL would imply 
a polynomial simulation of 2onfas by 2dfas. Furthermore, we also prove that each 2onfa 
can be made unambiguous with a polynomial increasing in the number of the states. An 
extension of this reduction to the alternating case will be discussed in Section [71 

Let us start to present our reduction. As for the results in Sections [H and [5l it is 
obtained by combining a technique developed for the unary case [GPllj . with the use of the 
subroutine Reach presented in Section [3l 

From now on, we consider an n-state 20NFA A in the form of Lemma 13.11 With each 
input string w € T,* we associate the directed graph G^w) = {Q,E{w)), where Q is the set 
of states of A and 

E{w) = {(p, g) £ Q X Q \ RBACn{p,q) is true}, 

i.e., it is the set of pairs (p, q) of states such that A on input w has a segment from p to q. 

It should be clear that w is accepted by A if and only if the graph G{w) contains a 
path from vertex qi, the initial state, to vertex qp, the accepting state. Hence, this defines 
a reduction from the language accepted by A to GAP. 
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As mentioned before, GAP is complete for NL under logarithmic space reductions |Sav70] . 
Hence, GAP G L if and only if L = NL. This permit us to prove the following^ 

Theorem 6.1. // L = NL then each n-state 2onfa can be simulated by a 2dfa with a 
number of states polynomial in n. 

Proof: Let us denote by L be the language accepted by the given n-state 2onfa A and 
by Dgap a deterministic machine which solves GAP in logarithmic space. Such a machine 
exists, under the hypothesis L = NL. 

We can build a machine M deciding whether or not a string w belongs to L by composing 
the reduction G above described with the machine -Dqap (see Figure [T]). 
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,yes 



^no 



w 



G 


G{w) 




r 



ryes 



^no 



Figure 1: The machines M of Theorem l6.ll (on the left) and Mu of Theorem 16.21 (on the right). 

The input of -Dgap is the graph G{w), more precisely its adjacency matrix, which can 
be encoded with 0{n'^) bits. Hence, Dgap uses O(logn) space on its worktape. This space 
can be represented in a finite state control with a number of states polynomial in n. (Notice 
that since the automaton A is fixed, n does not depend on the length of the input w.) 

On the other hand, the adjacency matrix of G{w) cannot be stored in the finite control 
of M, because it would require a number of states exponential in n. Hence, each time -Dgap 
needs to access one symbol from its input, such a bit is computed "on the fly". The bit 
represents a position in the adjacency matrix, corresponding to a pair (p, q) of states of A. 
Its value is 1 if and only if the automaton A has a segment from p to q on input w. Hence, it 
can be computed by calling Reach(p, q), the subroutine presented in Section [3l which uses 
4n — 3 states. The proof can completed as in [GPll^ Lemma 4.1], where a similar result has 
been proved for the unary case. □ 

While the deterministic simulation in Theorem 16. II is under the condition L = NL, the next 
simulation by unambiguous machines does not require any extra condition: 

Theorem 6.2. Each n-state 2onfa can be simulated by an unambiguous 20NFA with a 
number of states polynomial in n. 

Proof : In [RAOOj it was proved for each nondeterministic machine M working in logarithmic 
space there exists an equivalent unambiguous machine M' still working in logarithmic space, 
which uses of a polynomial advice [KL82j, i.e., of a family of strings {a{n) | n > 0} of 
polynomial length. The machine M' receives, besides an input string x, the advice string 
corresponding to the length of x, i.e., the string a(|x|). Hence, there exists an unambiguous 
machine Uqap of this kind that, with an appropriate advice, solves GAP. 

Given a 2onfa A with n states, by applying a construction similar to that in the proof of 
Theorem l6.H we can obtain an unambiguous two-way automaton Mu with a number of states 
polynomial in n that recognize the same language as A. Essentially, in the construction we 

^The result in Theorem 16. H is presented, in a different context, also in [KPllj . 
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have to replace the machine -Dgap by the machine Ugap with the appropriate advice (see 
Figured]). However, the advice depends only on the size of the input of Uqap, namely on 
the size of the graph G{w), which, in turns, depends only on the number of states of A, not 
on the input string w. So, given the 2onfa A, the advice is fixed, hence it can be encoded 
in the "hardware" of M„. 

Finally, we observe that Mu accesses its input tape only to compute the bits of the 
adjacency matrix of G{w), i.e., when the subroutine Reach is running. This subroutine is 
deterministic. It starts and ends its computation with the head at left endmarker. Hence, 
when the machine Uqap takes nondeterministic decisions the head is scanning the left 
endmarker. This permit us to conclude that the unambiguous machine is a 2onfa. □ 



7. The alternating case 

In this section we briefly discuss an extension of the technique used in Section [H to the case 
of automata with alternations [CKSSlj , recently considered in |Kap09 , IGefllj . 



A two-way alternating automaton (2afa, for short) is defined as a quintuple A = 
{Q,T,,6,qi, F), exactly as a 2nfa. However, the set Q is partitioned in two sets and 
Qv) the sets of existential and universal states, respectively. The acceptance of an input 
string w is witnessed by an accepting computation tree T. The nodes of T are labeled by 
configurations, the root is labeled by the initial configuration, the leaves are labeled by ac- 
cepting configurations (we can assume that accepting configurations are also halting). Each 
node of T labeled by a configuration c whose state is existential has exactly one son. The 
label of this son is one of the possible successors of c. Each node of T labeled by a config- 
uration c whose state is universal has one son for each possible successor of c. Notice that 
nondeterministic automata are just alternating automata without universal states. 

Even for 2afas, we can restrict the use of the nondeterminism as we did for 2nfas, 
considering outer- alternating finite automata (20AFAs). In these models, each configuration 
scanning an ordinary input symbol can have at most one successor, namely (existential or 
universal) nondeterministic choices can be taken only when the head is scanning one of the 
endmarkers. Actually, we can further restrict the use of the nondeterminism only at the left 
endmarker. In particular, using arguments similar to those of Lemma |3.H we can prove the 
following result H 

Lemma 7.1. For any n-state 20AFA A = {Q, S, 5, qi, F) there exists an equivalent 20AFA A' 
with no more than 3n states that satisfies the following properties: 

(i) nondeterministic choices are taken only when the input head is scanning the left 
endmarker, 

(ii) there is a unique accepting state q^ and this state is also halting, 

(iii) qp is reachable only at the left endmarker by stationary moves, 

(iv) stationary moves can occur only at the left endmarker. 



^Notice a small difference with respect to Lemma l3.ll Stationary moves are possible only at the left 
endmarker. However, here they can be used also to reach states different than qf. Due to the presence of 
both universal and existential states, the removal of these moves would require an argument more complicated 
than the simple one used to prove Lemma [3.11 However, for our purposes, we do not need to remove them. 
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Now, we consider the alternating graph accessibility problem (AGAP, for short), an alter- 
nating version of GAP. The instance of the problem is an alternating direct graph, i.e., a 
graph G = {V, E) with a partition of V in two sets V3 and V\j, and two designated vertices 
s and t. The question is if the predicate apath{s,t) is true, where, for x,y gV, apath{x,y) 
holds true if and only if: 

• X = y, 01 

• X E V3 and exists z with (x, z) ^ E such that apath{z, y) is true, or 

• X G Vv and for all z € V, {x, z) G E implies that apath{z, y) is true. 

This problem is known to be complete for the class P, with respect to logarithmic space 
reductions |Imm81j . 

As is Section [6l we can reduce the language accepted by a given 20AFA A to AGAP, by 
associating with each input string w the graph G{w) = {Q,E{w)), where {p,q) € E{'w) if 
and only if A has a computation segment from p to q on input w. (The extension of the 
notion of computation segment to 2AFA is obvious.) Since the subroutine Reach presented 
in Section [3] depends only on the transition function of the given automaton A and not on 
the acceptance condition, we can use it to detect segments even in the case of outer 2afas|1 

These permit us to prove the following result: 

Theorem 7.2. // L = P then each 20AFA can be simulated by a 2dfa with a polynomial 
number of states. 

Proof: Since AGAP is complete for P, if L = P then there exists a deterministic ma- 
chine -Dagap that in logarithmic space solves AGAP. Applying the same construction of 
Theorem 16. H just replacing the machine -Dqap by the machine Dagap, given an n-state 
20AFA A, we can define an equivalent 2dfa with a number of states polynomial in n. □ 

In a similar way, we are able to prove the following: 

Theorem 7.3. //NL = P then each 20AFA can be simulated by a 2nfa with a polynomial 
number of states. 

Proof: Under the hyphothesis NL = P, there exists a nondeterministic machine A'agap 
that in logarithmic space solves AGAP. By replacing in the previous proof the machine 
-Cagap by the machine A^agap we obtain a 2nfa with a number of states polynomial in n 
equivalent to the given 20AFA. □ 



8. Concluding remarks 

In this paper we generalized, in a unified framework, some results previously obtained for 
unary 2nfas to machines with arbitrary input alphabets, but making nondeterministic 
choices only at the input tape endmarkers. Among others, we have shown that any su- 
perpolynomial lower bound for the simulation of such machines by standard 2dfas would 

^Due to the possibility of stationary moves on the left endmarker, the subroutine Reach needs just one 
small change (line [45] in the implementation presented in Appendix Given the two parameters q' and q" , 
it preliminary checks if q' = q" or if q" is reachable from q' by a stationary move on the left endmarker. If 
the answer is positive, it immediately accepts, otherwise, it works as we described. 
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imply L 7^ NL. (Unlike in |BL77| . there are no restrictions on the length of potential 
witness inputs.) In Section [71 we also related the alternating version of such machines to 
L = NL = P, the classical computational complexity open problems. 

Comparing our results with those obtained for other restricted models of two-way au- 
tomata we observe that: 

• Actually, unary 2nfas can use only a restricted form of nondeterminism. In fact, we 
can restrict their nondeterminism to the endmarkers without increasing significantly 
their size |GMP03j . (A similar phenomenon has been observed by Chrobak in the 
case of unary one-way automata |Chr86j .) 

• In the general case, the possibility of reversing the input head movement at any in- 
put position does not seem so powerful as the possibility of making nondeterministic 
decisions at any input position. (Compare our polynomial upper bound for the com- 
plementation of 20NFAs with the exponential lower bound for the complementation 
of sweeping 2nfas in |Kap06| .) 

• However, in the deterministic case, the possibility of reversing the input head at any 
input position can make automata exponentially smaller than machines reversing the 
input head only at the endmarkers |Ber801 IMicSl] . 

It would be interesting to see if the results proved in this paper could not be extended 
to a model using the nondeterminism in a less restricted way than the one considered here. 
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A. The subroutine REACH 



Let ^ be a 2onfa in the form of Lemma |3. 11 Fixed q' , q" € Q, we first describe a 2dfa A! 
which halts on every input and accept a word w if and only if A on input w has a computation 
segment from q' to q" . Subsequently, we will discuss how to modify the construction to use 
q' and q" as parameters. 

First, we examine some trivial cases: 

• \{ q' = q" then A! can immediately accept. 

• \i q' ^ q" and 5{q' , h) = then A' can immediately reject. 

• \iq' ^ q" and q" = qp then if (gp, 0) E 5{q' , h) then A' immediately accepts, otherwise 
it immediately rejects. 

Now, we consider the remaining cases. Since the construction we present is obtained by 
modifying that given in |GMP07] for making 2dfas halting, the presentation will be given 
along the same lines and by keeping, as much as possible, the same notations. 

First, let us fix a linear order on the state set of the original automaton. As usual, the 
symbols "<" and ">" denote the ordering relation. We do not need to further consider the 
final state qp and stationary moves. In fact, the only possible stationary moves are those 
reaching qp on the left endmarker, qp cannot be reached in any other input position and it 
is halting (see Lemma l3.ip . All these possibilities have been already considered in the case 
of segments ending in qp. 

Our implementation of the depth-first search examines each configuration (q, i) in two 
modes: 

• Mode 1: Examination of the "left" predecessors of (g, i), namely configurations {p,i — 
1) such that (g, +1) G S{p,a), 

• Mode 2: Examination of the "right" predecessors, namely configurations {p, i + 1) 
such that {q, —1) G 5{p,a). 

For each q ^ Q and both modes, we introduce a starting and a finishing state. So our 
machine will use the following set of states: 

Q' = {^\, Qii, qy, qU-.qeQ- {qp}} U {qy}- 

These states are interpreted as follows: 

q\ Starting state for the Mode 1: examination of left predecessors for the configuration 
{q,i)- Left predecessors will be examined one after another, according to the linear 
order induced by the relation "<". To inspect the content of the input square i — 1, 
the simulator (if it is in the state q\) has its input head one position to the left of 
the actual position of the original machine A in configuration (q, i). 

qii Finishing state for the Mode 1. All the left predecessors of {q, i) have been examined, 
but we still have to examine the right predecessors of {q, i). In the state qii, the input 
head of the simulator is in the actual position, i.e., the position i. 

qy Starting state for the Mode 2, examination of right predecessors for {q,i), when the 
left predecessors have been finished. The right predecessors will also be examined in 
the linear order induced by "<". In the state g/", the simulator has its input head 
one position to the right of the actual position of the configuration (g, i), to inspect 
the symbol a in the input square i + 1. 
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qi2 Finishing state for the Mode 2. Both the left and the right predecessors of (g, i) have 
been examined. In the state ^4,2, the input head of the simulator is in the actual 
position, i.e., the position i. 
qy Halting accepting state. 
Let us now describe the transition function 6' : Q' x (Y^U {h, H}) Q' x {—1, 0, +1} of A' 
implementing this strategy. For each (type of) state r € Q' and each symbol a G S U {h, H}, 
we first display a procedure that assigns a value of (5'(r, a) G Q' x {— 1, 0, +1} to the transition 
table and, after that, we present an explanation for this procedure. Note that the simulator 
will use stationary moves. The reader should also keep in mind that the procedures displayed 
below are not executed by the machine A' but, rather, they are used to fill in the entries 
in the transition table for A'. In the description, we will suppose that the input string is 
w = W1W2 ■ ■ ■ Wn, with Wi € T,, ioi i = 1, ... ,n. 



Transition 6'{q\,a) 



20 if a ~-\ then d'{q\,a) :— undefined 

21 else if a =h then 

22 if (q, +1) e 6{q', h) then S'iq-K, a) := (gy , 0) 

23 else S'{q\, a) -.^ {qii,+l) 

24 else if there is no p G Q : S{p,a) = {{q,+l)} then S'{q\,a) := 

25 else 

26 q := mm{p e Q : S{p,a) ^ {{q,+l)}} 

27 S'{q\,a) := (q'K.,-!) 



This part presents several differences with the corresponding one in |GMP07] . Recall that 
A' gets to the state q\ when, for some i, it starts the examination of the left predecessors 
of the configuration {q,i), that is, of configurations {p,i — 1) such that {q,+l) G Wj-i). 
By definition of q\, A' has its input head already at the position i — 1. The procedure 
considers four cases: 

• a =H (line I20p : actually this case is unreachabale, it is given just for completeness, 
to fill in all entries in the transition table for 5' . 

• a =\- (from line I22p : in this case the original machine A is scanning the first input 
cell, i.e., that containing wi. There are two possibilities depending on the first 
parameter q'. If from the state q' on the left endmarker it is possible to move right 
in the state q, i.e., {q, +1) G S{q' , h), then the backward simulation of the segment is 
completed. Hence, the machine halts and accept (line [22]). Otherwise, since we are 
interested just in one segment, we ignore the left predecessors of {q, 1), since they 
correspond to moves from the left endmarker, i.e., to the starting of segments others 
than the one we are interested in. The machine immediately enters in the ending 
state of Mode 1 qu, moving right the input head, to reach the real position in the 
simulated machine (line [23]). 

• There are no left predecessors (line [24]): the machine ends Mode 1 (notice that, since 
we already considered a =\-, in this case and in the next one the transitions we have 
to consider are deterministic). 

• In the remaining case (lines [2611270 . there exists at least one left predecessor of 
configuration {q,i). We select the first one in the linear order and start to examine 
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this configuration with the same method. To this aim, we switch the state to q\, 
and move the head one position to the left of i — 1. 



Transition 6'{qii,a) 

28 if a T^H then S'{qu,a) ■— (^y, +1) 

29 else S'{qu,a) := {qi2,0) 



This case is exactly as in [GMP07] . In this state, the examination of the left predecessors 
of (q, i) has been completed. Hence, the search continues with the examination of the right 
predecessors in the Mode 2 (line [28]) , by switching to the state q/' and moving the head to 
the position i + 1. If the input head is already on the right endmarker, i.e., a =H, then 
the configuration {q,i) does not have any right predecessors (line I29p . Hence, by switching 
to qi2, we finish the Mode 2 immediately, as if all right predecessors had been searched. 



Transition 6'{q/-,a) 



30 if a =\- then d'{qy^,a) := undefined 

31 else if there is no p € Q : S{p, a) — {{q, —1)} then S'{q/', a) ;= {qu, — 1) 

32 else 



33 
34 



q := mm{p £ Q : S{p,a) = {{q,-l)}} 
5'{q/',a) := {q\.,-l) 



Even this case is as in [GMP07] . In the state q/', A' starts to examine right predecessors of 
{q,i), i.e., configurations {p,i + 1) such that 6{p,Wi+i) = {q, —1). A' has its head already at 
the position i + We observe that in a right predecessor of a configuration, the head cannot 
scan the left endmarker. Hence, all transitions from right predecessors are deterministic. 
There are three main cases: 

• a =\- (line [30]): unreachabale case given for completeness. 

• There are no right predecessors (line [3T]l : we finish the Mode 2 immediately, which 
completes the search for {q,i). 

• Otherwise (lines [33ti34|) we select {q,i + 1), the first right predecessor of {q,i), and 
start to examine it with the same method. (Among others, the left predecessors of 
{q, i + 1) are going to be examined.) To this aim, we switch to q\, and move the 
head one position to the left of i + 1. 



Transition 6'{qi2,a) 



35 if a —\- then S'{qi2,a) := undefined 

36 else if S{q,a) — then 6'{qi2,a) := undefined 

37 else 

(r, d) unique element of d{q, a) 

if there is no p £ Q : p > q and d{p, a) = {{r, d)} then 
if d = +1 then d'{qi2, a) := (ni, +1) 
else S'{qi2, a) :— {ri2, —1) 
else 

q :— mm{p £ Q : p > q and 5{p, a) = {(r, d)}} 
S'{qi2,a) := (q\, -1) 
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This state concludes the examination of the configuration (q, i), and all configurations in the 
subtree rooted at {q,i). The machine A' has its head at the position i, the actual position 
of the head of the simulated machine A. There are three main cases: 

• The head is scanning the left endmarker (line [35]) : since in the backward simulation 
we ignore the transitions from the left endmarker (see line [23] in the definition of 
d'{q\,a)), the only reachable configuration of the form {q,0) that can be reached 
is {q",0), namely, the configuration from which the backward simulation started. If 
this happens, it means that we have examined all the tree rooted at {q",0), never 
accepting. Hence, a computation segments from {q',0) to {q",0) on input w does not 
exists and A' have to reject. This can be done by leaving the transition undefined: 
the computation stops in a state different from the accepting state qy- 

• The configuration does not have any successor (line [36]) : a such configuration is never 
reached in the backward search. This case is included only for completeness. 

• In the remaining case (from line [38]) , since the head is scanning a position different 
from the left endmarker, the configuration {q,i) has as unique successor {r,j), which 
can obtained using the transition function of A (line [38]) . 5{q,a) = {{i,d)} and 
j = i + d. Depending on the value of d, we have to consider either left predecessors 
{d = +1) or right predecessors {d = —1) of {r,j) (we call them "d-predecessors" , for 
short). First we try to find a state p greater than q such that {p, i) is a d-predecessor. 
If it does not exists then {q,i) is the last d-predecessor of (r, j). Hence (depending 
on d), we complete Mode 1 or Mode 2 for (r, j) (lines [40l441|) . Otherwise, we start to 
examine in Mode 1, with the same method, the next d-predecessor of {r,j) (lines [33]- 

To complete the description of the automaton A' , we have to specify that the initial state 
is q"li. In fact, we are interested in a segment from q' to q" on input w. By definition, 
such a segment should end in the configuration {q",0). This configuration does not have 
left predecessors. This permits us to state the depth-first search from the state q"ii, with 
the head on the left endmarker. If there exists the segment we are looking for, then the 
computation ends, as explained before, in the state qy (line [22]). Otherwise, it ends in some 
different state (line [35]) after traversing the whole subtree rooted at {q",0). 

We observe that the automaton A' so described has 4n — 3 states and it depends on 
the two states q' and q" we have fixed at the beginning. What happens if we change those 
states, i.e., if we consider them as parameters? 

The only parts of the construction that depend on q' and q" are the choice of the initial 
state and the transitions from the endmarkers and states q\ (lines [22Vl23p . We can modify 
A' by keeping, for each possible value of the parameter q', a table with the values of 6'{q\, h). 
During the execution A' will consult the appropriate table depending on the value of the 
parameter. All the other transitions of A' are unchanged. Furthermore, we can initialize A' 
in a suitable state, depending on the parameter q" . 

Hence, using 4n — 3 states (not including the space used to store the two parameters 
q',q"), we can implement HEACii{q' , q") with the following procedure (which also considers 
the trivial cases and which never modify the two parameters): 
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REACH(g', q") 

45 if q' = q" then accept 

46 else if q' ^ q" and S{q' , h) = then reject 

47 else if q' ^ q" and q" = qacc then 

48 if (gaccjO) G S{q',\-) then accept 

49 else reject 

50 else run A' from the state q"ii 



By summarizing, we have shown that given an n-state 20NFA A in the form of Lemma l3.lt 
using a finite state control consisting of 4n — 3 states we can decide whether or not for an 
input string w and two states q' and q" the automaton A has a computation segment from 
q' to q" on w, hence proving Lemma 13.21 



B. The subroutines nREACH and tREACH 

Given a state q" , nREACH((7") should return a nondeterministic chosen state q' such that 
there exists a computation segment from q' to q" on the given input string. If the subroutine 
is not able to find such a segment, due to the fact that such a state q' does not exist or due 
to wrong nondeterministic choices, then the subroutine halts in the state q-,, aborting the 
entire computation. 

The implementation can be given as a modification of that of Reach. In particular, 
nREACH(g") uses the backward search described in Section starting from the configura- 
tion Cf = {q" , 0). When a configuration c with the input head scanning the first input symbol 
is reached, i.e., c = {q, 1), then the subroutine nondeterministically selects one between these 
two actions: 

• Output a nondeterministically chosen state q' such that (g, +1) G 5{q',\-). This 
means that {q',0) is a left predecessor of (g, 1) and so there is a segment from q' to 
q". (If such a state q' does not exists, the computation is aborted in q,.) 

• Ignore the left predecessors of {q, 1) and continue the backward simulation. Hence, 
the machine try to detect a longer segment. 

These actions can be implemented by replacing the selection on lines [22] and [23l with a 
nondeterministic choice between these two possibilities: 

• q":= a nondeterministically choosen state q' such that {q,+l) G 5{q' ,\-) (if such a 
state does not exist the entire computation is aborted in q-,). 

6'{q\,a) := undefined 

This stop the computation by producing the output in the parameter, as a side effect. 

• 6'{q\,a) := {qU,+l) 

The computation can also traverse the whole subtree rooted at c/ = {q" , 0) without reaching 
the starting configuration of a segment. This can happen either because a segment ending 
in q" does not exist or because a wrong sequence of nondeterministic choices. In this case 
we have to halt in the state q-,, aborting the entire computation. In the implementation of 
Reach this case was managed in line [35j We need simply to change the assignment into 
5'{qi2,a) := g,. 

This implementation of nREACH uses the same set of states of Reach, with the exception 
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of qy, plus the "global" state q-,. Hence it can implemented with 4n — 3 states. We should 
also notice that it can produce, as a side effect, a modification of the parameter. 

The nondeteministic subroutine tREACH is implemented calling t times nREACH as 
shown in the following high-level codeQ 



tREACH(g",t) 

51 t := t 

52 while < > do 

q" := nREACH(g") 

55 if q" 7^ qo then halt in 



// possible side effect of nREACH: abort in g. 



*The (redundant) assignment on line[53]is to emphasize that nREACH(g") leaves its output in the param- 
eter. 
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